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ABSTRACT.  “'The  statistical  efficiency  of  designs  associated  with 
the  linear  model  is  by  and  large  measured  solely  on  the  basis  of 
the  information  matrices.  It  is  shown  that  when  data  from  two 
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experiments  with  the  same  model,  which  might  contain  nuisance 
parameters,  but  with  possibly  different  design  matrices  Are  com- 

blned,  then  the  resulting  information  matrix  is  larger  (in  the 
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sense  of  nonnegatlve  definiteness)  than  the  sum  of  the  individual 
information  matrices.  Cases  where  equality  is  achieved  are  com- 
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pletely  characterized  geometrically  and  statistically.  Conditions 

/  '  . 

where  the  best  linear  unbiased  estimator  of  estimable  functions 

are  obtained  as  linear  combination  of  the  best  linear  unbiased 
estimators  of  the  same  function  from  the  individual  experiments 


are  determined. 
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ON  THE  SUPPERADDITIVITY 
OF  INFORMATION  MATRICES 
IN  GAUSS -MARKOV  MODELS 

By 

A.S.  Hedayat  and  Dibyen  Majumdar 

Department  of  Mathematics,  Statistics 
and  Computer  Science 

University  of  Illinois,  Chicago 

1.  INTRODUCTION 

Scientists  perform  experiments  to  gather  information  about 
some  underlying  phenomenon.  What  constitutes  information  largely 
depends  on  the  problem  at  hand  and  the  mode  of  Inference  to  be  made. 
However,  one  thing  is  obvious  that,  no  matter  what  the  definition 
of  information,  it  should  be  nondecreasing  in  the  number  of  obser¬ 
vations.  Here,  we  shall  exclude  from  our  consideration  cost  and 
any  other  nonstatistical/mathematlcal  matter.  For  example,  the 
notion  "amount  cf  information  per  $"  is  of  no  consideration  to  us. 
With  this  philosophy  scientists  working  on  identical  problems 
should  combine  their  data  for  the  purpose  of  analysis  and  inference. 

There  are  various  notions  of  information  which  have  found 
popularity  amoung  scientists.  Before  adopting  any  such  notion 
one  has  to  establish  the  minimum  requirement  that  it  is  nondecreas¬ 
ing  in  the  number  of  observations.  Statisticians  dealing  with 
experiments  whose  data  follow  the  linear  model,  by  and  large,  have 
adopted  the  inverse  of  the  variance  as  the  amount  of  information. 

Or  more  generally,  the  "smaller"  the  variance-covariance  matrix 
the  more  information  has  been  obtained.  Equivalently,  the  "bigger" 
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the  information  matrix  (the  inverse  of  the  variance -covariance 
matrix)  the  more  information  is  provided.  Here,  we  have  adopted 
the  following  concept.  If  A  and  B  are  two  nonnegative  definite 
matrices  then  we  say  A  is  at  least  as  large  as  B  (written  as 
A  a  B)  if  A-B  is  a  nonnegative  definite  matrix.  A  is  bigger 
than  B  (A>B)  if  A-B  is  positive  definite. 

As  we  noted  earlier  if  and  Dg  are  two  sets  of  data 

for  the  same  problem  they  should  be  combined  for  the  purpose  of 
analysis  and  inference  since  Information,  /,  provided  by  U  Dg 

is  as  large  as  or  .fg,  the  information  provided  by  and 

Dg  respectively.  Thus  in  general  S  a  S ^  ,i  ■  1,2.  In  view  of 
the  above  a  natural  question  is  this.  Are  there  settings  for 
which  *Me  can  relate  /  to  f  ^  +  /g.  This  paper  answers  this 
question  in  affirmative.  We  establish  that  for  the  classical 
linear  model  there  is  a  s upperadditivity  for  the  information 
matrix  of  the  combined  data,  i.e.,  S  a/^  +  Sg  whether  or  not 
there  are  nuisance  parameters.  We  prove  this  in  Section  3.  In 
this  section  we  also  give  both  statistical  and  geometrical  char¬ 
acterizations  of  cases  in  which  /  ■  f  ^  +  /g. 

In  Section  4  we  study  the  status  of  Gauss -Markov  theorem  _ . 

for  the  combined  data  and  shall  explore  its  relation  to  associated  A 
Gauss -Markov  theorems  for  the  individual  sets  of  data.  Expliclty, 
we  provide  answers  to  the  following  questions.  (1)  Given  a 
linear  parametric  function  p'6^  (flg  Is  the  vector  of  nuisance 
parameters)  which  Is  estimable  under  both  sets  of  data,  then  under 
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what  conditions  can  the  BLUE  of  p'0^  under  the  combined  data  be 
expressed  as  a  linear  combination  of  BLUES  of  p'9^  under  indi¬ 
vidual  sets  of  data,  (li)  The  same  question  as  in  (1)  except 
that  we  insist  the  same  conclusion  be  true  for  all  estimable 
functions.  Besides  characterization  of  involved  cases  we  give 
explicit  forms  of  the  linear  functions  of  BLUEs  so  that  we  can 
save  some  computational  time  in  case  for  each  set  of  data  the 
BLUEs  have  already  been  computed . 

In  Section  5  we  indicate  how  the  results  in  Sections  3  and  4 
could  be  utilized  in  the  area  of  optimal  design  of  experiments. 

We  give  examples  to  illustrate  our  results  throughout  the  paper. 
To  develop  our  theory  we  need  some  preliminary  results  which  we 
have  summarized  in  Section  1. 

2.  PRELIMINARY  RESULTS. 

In  this  section  we  shall  introduce  the  model,  some  notations 
and  state  some  known  and  not  so  well  known  results,  which  are 
needed  in  the  subsequent  sections. 

We  shall  be  using  the  classical  Oauss -Markov  model  through¬ 
out  this  paper.  This  may  be  described  as  follows: 

Y  -  Xiei  +  V2  +  e  .  (2.1) 

Here  Y  is  the  vector  of  observations .  X1  is  the  design  matrix 
associated  with  62,  which  is  a  vector  of  all  parameters  of 


4 


Interest.  Xg  is  the  design  matrix  associated  with  0g,  which 
Is  a  vector  of  nuisance  or  covarlate  parameters .  e  Is  the  usual 
error  vector  with  E(e)  =  0,  which  we  assume  to  he  homo seed as tic, 

p 

i.e.,  V(e)  a  a  I  ,  where  I  dentes  the  identity  matrix  whose  dimen¬ 
sion  will  he  clear  from  the  context. 

Throughout  we  adopt  the  following  notation:  For  a  matrix  A, 
-C(A)  will  denote  the  column  space  of  A,  r(A)  the  rank  of  A 
and  A"  a  generalized  inverse  of  A.  The  matrix  PA  will  denote 
the  orthogonal  projection  operator  onto  .11(A) .  More  explicitly, 

PA  «  A (A* A) "A'.  Orthogonality  for  us,  is  always  in  terms  of  the 
dot  product.  If  A,  B  and  C  are  square  matrices  then  the  block 
dlangonal  matrix 


will  he  denoted  hy  diag  (A,B).  Dlag  (A,B,C)  will  have  a  similar 
meaning. 

We  shall  now  state  and  prove  four  lemmas.  Lemmas  2.1  and  2.2 
describe  well  known  results  of  Gauss -Markov  models  in  a  form  suit¬ 
able  for  us.  Lemmas  2.3  and  2.4  are  purely  algebraic  in  nature. 


Lemma  2.1.  b'T  Is  the  BLUE  of  the  linear  parametric  function 
p'O  for  the  Gauss -Markov  model 


T-  »  +  «  »  E(«)  -  0  ,  V(  )  -  o2 1 

Iff  Z'b  •  p  and  b  <  JE(X). 


5. 


Proof.  b'Y  is  unbiased  for  p'0  iff  X'b  =  p.  Then,  by 
Lehmann -S c hef f e  Theorem,  b'Y  is  BLUE  of  p'0  iff 
COV(b'Y,  z'Y)  =  0  for  every  z  for  which  E(z'Y)  *  0  for  all  0. 
i.e.,  b'z  »  0  for  every  z  such  that  z'X  ■  0.  Note  that  the 
condition  of  the  Lemma  on  b  satisfies  the  Lehmann-Scheffe  con¬ 
dition  since  if  b  e  -C(X)  ,  then  obviously  b'z  *  0,  whenever 
z'X  =  0.  Conversely,  given  b  write  b  =  b^  +  bg,  where 
b1  e  /(X)  and  bgX  =  0.  Hence  by  Lehmann-Scheffe  theorem 

0  =  b'bg  =  b£bg  , 

which  implies  bg  =  0  ,  i.e.,  b  -  e  £(X) . 

Lemma  2.2.  (i)  b'Y  is  BLUE  of  the  parametric  function  p'0^ 

for  the  Gauss-Markov  model  (2.1)  iff 

Xjb  »  p  j  X£b  =  0  and  be  ^(XpXg) 

(ii)  b'Y  is  BLUE  of  E(b'Y)  for  the  model  (2.1),  iff 

b  e  ^[(I-Px  )X,] 

*2  A 

Proof.  (1)  Unbiasedness  of  b'Y  implies 

b'Xl9l  +  b<X292  "  p'el  '  for  a11  el»  ®2 
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Hence  X^b  ■  p  and  X£b  *  0  .  The  rest  of  (i)  follows  from 
Lemma  2.1.  Observe  that  X£b  =  0  iff  b  =  (I-P^ 

But  b  *  xihi  +  ^2h2  for  some  hi»h2’  since  13  €  --(X^Xg). 

Hence  b  =  (I-Px  =  (I_px  ^Xlhl  6  ^xi^*  Conversely  if 

b'Y  =  h'X^(l-Pj£  )Y  ,  then  Lehmann-Scheffe  Theorem  can  be  used 
exactly  as  in  Lemma  2.1  to  show  that  b'Y  is  BLUE  of  E(b'Y). 


Lemma  2.3.  Let  A  and  B  be  two  matrices  such  that 

Jt(A')  =  /(B').  Then  for  each  p  e  /(A'),  there  exists  a  vector 

h  and  a  scalar  a  (depending  on  p)  such  that 


A' A  h  =  ap 
B'B  h  -  (l-o)p 


(2.2) 


are  simultaneously  satisfied,  iff  B'B  =  X  A'A  for  some  \  >  0. 


Proof.  First  note  that  /(A')  =  /(A 'A)  and  Z(B')  »  /(B'B)  , 
so  p  €  /(A 'A)  *  /(B'B).  Also,  if  (2.2)  is  satisfied  for  some  h, 
for  a  given  non  null  p,  then  p'h  »  crp'  (A'A)“p  ■  (l-a)p' (B'B)"p. 
Solving  for  a,  we  obtain 


p ' (B  *B) ~p 

p'(A'A)“p  +  p'(B'B)“p 


(2.3) 


Hence  (2.2)  implies  that  a  is  of  the  form  (2.3).  We  shall  now 
prove  the  'only  if'  part  of  the  theorem.  It  is  well  known  that 


1  4 

X  , 


■m 


any  two  nonnegative  definite  matrices  can  be  simultaneosly  diago¬ 
nalized  by  a  single  nonsingular  matrix  (see  Rao  and  Mitra  (1971)* 
p.  122).  So,  let  T  be  a  nonsingular  matrix  such  that 

A 'A  =  T'  diag(D1,0)T  ,  B'B  a  T'  diag(D2,0)T  (2.4) 

for  positive  definite  diagonal  matrices  D1  and  Dg  of  the 
same  dimension  (say  r),  since  JC(A'A)  a  „?(B'B).  Equation  (2.2) 
may  be  written  as 

T'  dlag(D,  ,0)  T  h  »  otp 

(2.5) 

T'  diag(Dg,0)  T  h  *  (l-a)p 

Suppose  A 'A  and  B'B  are  mxm  matrices.  Partition 

T'  »  (T^  :  T£) 

where  T£  is  mxr  and  is  mxm-r.  Let  s  a  T  h , 

q  «  (T')'1?  «  Qp  ,  where  a  T'.  Corresponding  to  the  partition 
of  T,  partition  s'  a  (s£  :  s£)  ,  q'  a  (q£  :  q£)  ,  Q'  a  (Q|  :  Q£) .  The 
QT'  a  I  implies  *  1  »  *  0.  Since  by  (2.4) 

>€(A')  >  J5(T£)  ,  p  a  T£w  for  some  w.  Hence  q^  *  Q^p  a  0  and 
Q1  *  •  w.  So  as  p  varies  over  the  whole  of  JE(T^),  q^ 

varies  over  the  entire  euclidean  space  Rr.  Equation  (2.5) 


reduces  to 
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D1  S1  =  a  ql 
D2  =  (1-a)  qx 


(2.6) 


If  there  exists  h,a  such  that  (2.2)  is  satified  for  all 
p  e  A’)  ,  then  there  exists  s1  ,  a  such  that  (2.6)  is  satisfied 
for  all  e  Rr.  Given  q1  0  ,  it  follows  from  (2.6)  that 


i.e. , 


—  a  q^  —  ( 1— ot)  Dg  q^ 
DgD^1  qx  =  a"1(l-a)q1 


(2.7) 


Mote  that  a_1(l-a)  >  0  ,  since  from  (2.3)  0  <  a  <  1. 

Write  Dj  *  diag(dj1, . . .,djr)  ,  j  *  1,2.  Since  (2.7)  is  satisfied 

for  all  q1  e  Rr,  d2i^dli  =  a”1(1-a)  »  for  a11  i  “  l,...,r, 

by  choosing  q^  ■  (1,1, ...,l),  a  vector  of  one's.  Hence 

Dg  *  a“1(l-a)D1  ,  which  implies  B'B  =  X  A'A,  with  X  =  a_1(l-a). 

To  prove  the  if  part,  notice  that  if  B'B  «  XA'A  for  some 
X  >  0,  then  the  only  admissible  value  of  a  ,  by  equation  (2.3),  is 


X‘1(l  +  X"1) 


-1 


(1  +  X)"1 


(2.8) 


If  h  is  chosen  to  satisfy  A'A  h  *  ap  ,  then  B'Bh  =  XA'Ah  * 
Xap  ■  (l-a)p  ,  by  equation  (2.8).  Hence  (2.2)  is  satisfied 
whatever  p  e  /(A'). 

Let  A  and  B  be  two  matrices  with  the  same  number  of 
columns.  Then  the  following  is  well  known  (Rao(l965)  P.  3*0  *• 


9. 


.f(A'  )  PI  J5(B* )  ■  (0}  *  A'A(A'A  +  B'B)  ~  A  'A  =  A  'A  . 

In  the  following  Lemma  the  converse  is  also  established. 

Lemma  2.4.  The  following  are  equivalent. 

(i)  X(A')  n  £(B')  »  {0} 

(ii)  A'A(A'A  +  B'B)~A'A  =  A'A  for  some  g-inverse  of 
A  'A  +  B'B. 

Proof.  To  prove  (ii)  *  (i)  observe  that  if  (ii)  is  satisfied  for 
some  g-inverse  of  A'A  +  B'B  ,  then  it  is  satisfied  for  all 
g-lnverses.  Let  T  be  a  nonsingular  matrix  such  that 
A'A  »  T'  diag(D1,0,0)T,  B'B  »  T'  diag(Dg,D3,0)T  where 
are  diagonal  matrices.  and  Dg  are  of  the  same  dimension, 

D1  is  positive  definite.  Then  A'A  +  B'B  »  T'  diag(D1  +  Dg,D3>0)T. 
Choose  the  following  g-inverse 

(A'A  +  B'B)"  -  T"1  diag((D1+D2)‘1,  D"  ,  OjT'"1  . 

Then  (ii)  Implies 

diag(Dr0,0)  diag((D1+D2)'1,D3,0)  diag(Dlf0,0)  »  diag(Dlf0,0) 
i.e. ,  D^Dj+Dg)  ^ 
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i.e.. 


0 


Hence  if  T'  =  (T£  :  T£  :  T^)  in  the  partitioned  form,  then 

A‘A  »  T^D1T1  and  B'B-T^^Tg. 

Thus  „C(A ' A )  «  iI(Tp  and  „€(B'B)  c  il(T£)  .  Hence  (i)  follows. 

3.  A  LOWER  BOUND  ON  INFORMATION  MATRICES 

The  main  purpose  of  this  section  is  to  prove  that  the  concept 
of  information  in  the  context  of  linear  models  defined  in  the 
introduction  is  superadditive,  i.e.,  the  information  associated 
with  the  combination  of  two  experiments  is  at  least  equal  to  the 
sum  of  the  information  provided  by  individual  experiments.  We 
also  examine  the  statistical  and  geometrical  interpretation  of 
those  cases  where  equality  is  achieved.  Various  examples  are 
provided  to  elucidate  the  theory. 

Let  there  be  two  experiments  with  the  following  models: 

O 

Experiment  1:  ■  ^11^1  ^12^2  ^  ^1  *  *  0  ,  V(e^)  ®  c  I 

o 

Experiment  2:  Yg  *  *21®!  +  *22*2  +  €2  »  E(e2^  “  0  *  V(cg)  ■  o  I 

The  meaning  of  the  symbols  in  each  model  is  the  same  as  in  equa¬ 
tion  (2.1).  The  number  of  observations  in  experiment  1  is  n^ 
and  in  experiment  2  is  ng.  It  is  well  known  that  the  information 
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matrices  for  estimable  functions  of  81  for  the  first  and  second 
experiments  are  respectively 


^1  *  Xll^In1  "  xl2^xi2X12^  Xi2^Xll 
and 

*2  m  ^l^1^  *  'hsS'tythst)  ^^l  * 


The  term  information  matrix  stems  from  the  fact  that  the  variance 
of  the  BUJE  of  an  estimable  function  p'0^  is  proportional  to 
p'/£p  and  p'/gP  for  the  two  experiments .  Similarly  the  linear 
model  and  the  information  matrix  associated  with  the  n^  +  ng 
observations  obtained  from  combining  the  two  experiments  may  be 
written  as 


Y  . 


and 


/  - 


(Xii:^i)[In-(^)(Xi2Xi2  + 


,  E(e)  -  0  ,  v(€)  -  a2I 

X^gXgg)  (*i2  :  *22  ^(x^) 


where  n  ■  n^  +  ng  . 

For  simplicity  of  notations,  let  us  define  the  following 
orthogonal  projection  operators: 


P 


1 


$ 
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where  the  notation  was  defined  in  Section  2.  The  Information 
matrices  may  now  be  written  as 

'i  -  xii<vPl)Xl1 
/2  -  *21  *2^*21 
/  -  (X£x  :  X2i)Un-p)^ii^ 


We  shall  drop  the  subscripts  for  the  identity  matrices  whenever 
they  are  understood  from  the  context. 

It  is  easy  to  verify  that  /  a  and  /  a  /g.  Hence  infor¬ 
mation  is  nondecreasing  in  the  number  of  observations.  In  the 
following  theorem  we  establish  much  more  -  information  matrices 
gare  shown  to  be  superadditive. 

Theorem  3.1.  *  a  f  ^  (3.1) 

with  equality  iff 


'<5 


ti  y 
*12  *12' 


(3.2) 


Proof:  /  -  -  /2  -  (%  :  X^Kdiag^  ,  'P)/Xll\ 

{hl' 


is  the  orthogonal  projection  operator  onto 


9 


while 
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diag(P,  ,  P«)  is  the  orthogonal  projection  operator  onto 
(X12  0  } 

/  '■  q  ^  /  .  Clearly, 


Thus 


(diag^  ,  P2))P  -  P 

So,  by  Theorem  5.1.3  of  Rao  and  Mitra  (1971)  diagfP^  ,  P2)  -  P 
is  the  orthogonal  projection  operator  onto 

Cf  i)} "  {-« -  «■*> 

Here  for  any  subspace  L,  LA  will  denote  the  space  of  vectors 
orthogonal  to  each  vector  in  L. 

In  particular,  diagCP^  ,  P2)  -  P  is  nonnegative  definite  and 
hence  /  -  -  / 2  *  0  . 

Equality  is  achieved  iff 


(Ol^Pl-V-P)^)  -  0 


which  is  equivalent  to 


(3.3) 


14. 


Note  that  S  consists  of  vectors 


such  that 


(z*)  €  K  02  X22)  •**  ^X12  :X22Kz2)  * 

Z2  *  *22u2  ’  and  X12X12U1  +  *22*22U2  *  0 
(3.5)  is  satisfied  iff  X-j^  +  ^lz2  *  0 


0  .  i.e.. 


for  some 
for  all 


81 

ttl 
81 » 


above  form.  This  is  equivalent  to  the  statement 


-  X^i  , 

and  u2  . 
z2  of  the 


X11X12U1  *21*22U2  *  ®  whenever  ^^2^12^1  ^  X22X22^2  *  ®  (3  *4) 

(3.4)  is  clearly  equivalent  to  (3.2).  Hence  the  theorem. 

Note  that  (3.2)  and  (3*4)  are  equivalent  to; 

XllX12al  *  *21*22*2  whenever  *12*12*1  "  *22*22a2  (3*5) 

These  equations  give  geometric  interpretation  of  equality  in  equa¬ 
tion  (3.1).  In  the  next  theorem  we  give  a  statistical  interpretation 
We  shall  show  that  every  BLUE  in  the  combined  experiment  is  a  sum 
of  some  suitably  chosen  BLUEs  of  the  individual  experiments.  Each 
of  these  BLUEs  may,  however,  be  estimating  different  parametric 
functions . 

Let  »  (All  BLUEs  for  all  estimable  functions  of  01  in 
experiment  1),  i  «  1,2,  and  ?  *  (All  BLUEs  for  all  estimable 
functions  of  6^  in  the  combined  experiment j .  By  Lemma  2.2  (il), 

.  (b'Y|b  c  /[(I-P^X^J)  ,  i  *  1,2  and 
t  -  (b'Yjb  e  jetU-PKXk  :  X£x) ' )  . 
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Theorem  ?.2.  /  -  +  f2  iff  any  b'Y  e  f  can  be  written  as 

h’Y  -  b£Y^  +  bgY2  with  b^Yt  e  9±  ,  1  »  lf2  . 

Proof:  If  +  "then  from  the  proof  of  Theorem  3.1  it 

follows  that 

(xil  !  -  (*ii  :  ^Mdiagd-Pj,  I-P2)). 

Hence  h'd^  :  %)(l-P)T  -  h'x^L(I-P1)Y1+  h'X^1(I-P2)Y2  . 

This  establishes  the  necessity.  To  prove  the  sufficiency, 
suppose  b'Y  c  t  and 

b'Y  -  b{Y1  +  b^Y2  -  (b*  :  b£)Y 

where  b£Y1  €  and  b£Yg  e  ffg  .  Applying  Lemma  2.2  (1)  to  the 
combined  model. 


l.e.,  for  some  h  ,  ^ 


/xn  xi2n 

1X21  XggJ 

!h  + 

(3.6) 

Lh  +  Xggi^ 

(3.7) 

Premultiplying  (3.6)  by  (I-P^  we  get 

(I'Pl)bl  *  (I‘Pl)Xllh 

But  (I-P1)b1  =.  bx ,  since  bx  €  JCffl-P^X^]  and  I-Px  is 
idempotent .  Hence 

bl  *  (^Xllh  * 

Similarly  bg  ■  (I-PgJXg^h  ,  from  equation  (3.7). 

So  b'Y  «  h'X^1(l-P1)Y1  +  h'X^1(l-P2)Y2.  Since  b'Y  can  be  any 
element  in  ?,  this  implies  that  for  each  s,  there  exists  an  h 
(a  function  of  s)  such  that 

8'  (xil  :  *21)(I‘P)Y  -  h,xiid-pi)Yi  +  h'X^1(l-P2)Y2  (3.8 

Talcing  expectation  on  both  sides  of  (3*8),  we  obtain 


s'/  -  h'^  +  h'/2 


Defining 


(3.9) 


/•  * /Qh . 


(3.10 


Since  (3.10)  is  true  for  all  s,  /  ■  /  H  ,  for  some  matrix  H. 


Thus  M)  e  /(/„)  •  Bat  M  J  e  IV)  tnm  (3.1).  H«nce  »e 
have, 

M)-tV0)  <3-13-> 

Prom  (3.10)  we  get, 

h »  /~/s  f°p  some  *;  * 

Now  taking  variances  in  (3.8) »  It  follows  that 

s’/ s  »  h'/jh  +  h'f2h  -  h*/0h  -  s'/J^  S Q  S 0H 

«  s'//q/s  , 

since  # o*o  *  *  *  U8lng  th*  fact  that  *o  is  a 

g-lnverse  of  /Q  ,  the  latter  being  symmetric,  and  using  (3.11). 

The  above  being  true  for  all  s,  and  since  both  /  and 

are  symmetric ,  one  may  write 

//-/./  (3.:*) 

Note  that  this  relation  is  true  for  all  g- inverse*  of  /0  since 
//“/  is  invariant  under  choice  of  /“  (see  Rao  and  Mitra  (1971) 

LWH  2,2.4(111))* 


18. 


Let  T  be  a  nonsingular  matrix  such  that  /  ■  T*  dlag(D,o)T 
and  / Q  »  T'dlag(Do,0)T  .  D  and  DQ  have  the  same  dimension 
and  are  both  nonsingular  due  to  (3.11).  Choose 
/”  »  T“*diag(D^  ,0)T'“*.  Using  these  representations  In  (3.12) 
we  obtain  D  »  DQ  and  hence  /  »  /  . 

In  view  of  Theorem  3.2  ,  the  results  of  Theorem  3.1  can 
be  given  the  following  Interpretation,  f  ■  /^  +  /2  » 

?  c  f x  e  ff2  (3.13) 

the  algebraic  sum  of  f  ^  and  £2  .  If  /  >  +  /2  ,  then 

(3.13)  la  violated.  This  means  that  elements  In  £  -  £^  e  £2 
are  BLUEs  which  cannot  be  computed  from  BLUEs  of  the  original 
experiment.  This  Is  surely  due  to  one  or  both  of  the  following 
conditions : 

C.l.  (Expectation  condition):  There  exists  ape  Jt(J )  which 
cannot  be  written  as  p'6^  *  pjd^  +  with  p1  e  and 

P2  e  /(/2)  .  Note  that  the  column  space  of  an  Information  matrix 
provides  all  estimable  functions. 

C.2.  (Variance  condition):  There  exist  a  p  In  the  set 
{p|p  «  M(J)  *nd  p»ex  -  P{8X  +  p£lx  ,  where  p1  <  /(fl^  and 
P2  e  l(J2)  for  some  p^  ,  pg|  ,  but  for  all  representations  of 
such  a  p,  the  BLUE  of  p'l^  In  combined  experiment  has  a  smaller 
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variance  than  the  sum  of  the  BLUE  of  in  experiment  1  and 

the  BLUE  of  in  experiment  2. 

We  shall  elucidate  the  above  with  some  examples  in  block 
designs.  First  we  state  a  corollary  which  gives  a  general 
characterization  of  /  »  /1  +  /2  for  block  designs. 

We  assume  the  usual  additive  model.  01  consists  of  the 
treatment  effects  and  0g  the  block  effects.  Let  experiment 
1  use  blocks  B^, . . . jB^B^  • • • »Br  and  let  experiment  2  use 
blocks  Bq+1, . . .  . . .  fB^ 8  .  To  keep  our  discussions 

general,  we  assume  that  the  number  of  observations  in  B^  for 
experiment  1  is  k^  and  that  for  experiment  2  is  kg^  ,  and 

need  not  equal  kg^  when  q  +  1  <.  i  <  r  .  Let  ,  i  =  1,2 
denote  the  incidence  matrices  for  the  two  experiments,  i.e., 

“«  »2  ■  %  ■ 

and  Xg2  bas  rfs  columns  each.  Moreover  partition 

H1  *  [N11  *  W12  !  03  M  *2  “  [0  *  M22  s  N233 

where  has  q  columns,  W^g  and  *22  have  r-q  columns 

each,  *23  has  s  columns.  It  is  clear  that  *12*12  " 
dlag(k^^,  •  • . ,k^^,0,  •  •  •  ,0)  ,  X£gXgg  ■  dlag(0, . .  •  ,0,kgq^^,  *  •  •  ,kg^8) 
The  corollary  may  now  be  stated. 


20. 


Corollary  3.1.  Por  a  block  design  }  *  *#1  +  $2  *** 

(dlagCk^i,...,^)^  -  (dlagfk^^,...,^^)^  (3.13) 

Proof?  Prom  equation  (3.2),  /  -  /x  +  /g  Iff  there  exists  a  matrix 
T  such  that 

N1  “  X12X12T  811(1  N2  "  X22X22T 
Partitioning  T'  »  [T£  :  :  T^]  ,  one  obtains 

"  (dlag(k^ ^ » « » « »h^g) )T^  ,  N-jp  ■  (<llag(lt^q^, . , .  )  )^2 

N22  "  (dlas(k2q+i»***»k2r))T2  *  N23  "  (dlas(,t2x*|.i»  •  •  •  »*2rfs^T3  . 

Clearly  and  always  exists,  and  Tg  exists  Iff  (3.13) 

Is  satisfied. 

Thus  }  »  /1  +  /g  Iff  the  designs  In  the  common  blocks  sure 
"proportional"  In  the  sense  of  (3.13).  In  particular.  If 
*11  "  *21  »4+l<l*r,  then  the  designs  should  be  Isomorphic, 
l.e. ,  R^g  »  H£g  .  If  the  two  experiments  have  no  blocks  In  common 
then  obviously  /  ■  ^  +  /g  —  a  fact  which  is  generally  well 
known  to  researchers  in  the  theory  of  optimal  designs. 

Example  3.1.  This  is  a  simple  though  somewhat  pathological 
example.  However,  one  may  easily  extend  It  to  more  nontrivial 
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cases.  Consider  the  following  block  designs  with  Incidence  matrices 


N 


1 


the  two  experiments  using  the  same  treatments  and  blocks.  The 
combined  experiment  has  the  Incidence  matrix 


N  . 


Here  R (f)  ■  r(/1)  »  r(/2)  — there  Is  only  one  estimable  function. 

Moreover  -  f2  .  But  />/,+/„,  by  Corollary  3.1.  This 
Is  because  each  experiment  has  an  observation  confounded  with  the 
second  block.  These  observations  are  released  for  estimation  of 
treatments  when  the  experiments  are  combined.  This  Is  an  example 
of  condition  C2. 


Example  3.2.  Consider  the  following  experiments  using  the  same 
treatments  and  blocks. 


H 


1 


>  2 


/°  1  °\ 
0  10 
101 
Vi  o  l / 


Kerning  the  treatment  effects  ,  1  *  1  s  4  ,  we  see  that  the 
estimable  functions  in  each  experiment  are  -  t2  and  Tj-t^ 

and  their  linear  combinations  only.  r2  ~  becomes  estimable 
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too  upon  combining  the  experiments.  Here  »  /g  but  f  >  /^  + /2  • 
It  will  be  seen  in  the  next  section  that  the  BLUEs  for  -  t2 
and  for  the  combined  experiment  is  a  combination  of  the 

BLUEs  of  the  individual  experiments.  Thus  this  is  an  example  of 
condition  Cl.  A  combination  of  designs  in  Example  1  and  Example  2 
will  give  an  example  of  both  Cl  and  C2. 

In  view  of  Corollary  3.1  it  is  easy  to  construct  examples 
where  f  »  +  /g  .  Note  also  that  f  ~  ^  i  +  ^ 2  w**en  there  are 

no  nuisance  parameters,  i.e.,  Qg  -  0  . 


4.  ON  THE  ADDITIVITY  OF  BLUES  IN  COMBINED  EXPERIMENTS. 

If  two  sets  of  data  are  collected  for  a  fixed  linear  model 
I  then  it  is  obvious  that  one  should  combine  the  data  and  carry  out 

one  analysis.  In  Section  3  we  measured  the  gain  in  combining 
the  data  in  terms  of  information  matrices.  In  this  section  we 
shall  study  the  status  of  the  Gauss -Markov  theorem  for  the  combined 
data  and  explore  its  relation  to  the  associated  Gauss-Markov  ] 

i 

theorems  for  the  individual  sets  of  data.  In  particular,  we  shall 
deal  explicitly  with  the  following  questions: 


* 

« 

i 

Question.  1 

* 

t 

p»ex  which 

i 

conditions , 

BLUE  of  p'e^  under  the  combined  data  -  linear  com-  ^  ^ 
bination  of  BLCEs  of  under  the  individual  sets 

of  data? 
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Question.  2.  Same  as  Question  1,  except  that  we  Insist  that 
the  statement  (4.1)  is  true  for  all  estimable  functions.  Thus 
If  f  Is  any  linear  function  of  6^  estimable  under  the  both 
sets  of  data,  we  want 

BLUE  of  |  under  the  combined  data  =  linear  com¬ 
bination  of  BLUEs  of  4  under  Individual  sets  of  (4.2) 

data. 

We  shall  completely  characterize  cases  under  which  (4.1) 
and  (4.2)  are  valid.  These  characterizations  turn  out  to  be 
mathematically  interesting  and  reveal  more  about  the  structure  of 
the  information  matrices.  These  results  would  be  of  practical 
significance  since  in  the  cases  where  (4.1)  or  (4.2)  are  valid, 

|  a  lot  of  computational  time  can  be  saved  if  the  BLUEs  for  each 

individual  sets  of  data  are  already  available. 

Before  starting  on  these  problems,  we  remark  that  Theorem 
3.2  answers  a  question  similar  to  Question  2.  If  /  *  +  /g  , 

h  is  a  solution  of 


/  h  »  p  ,  (4.3) 

and  p^a/jh  ,  1  >  1,2  then  from  the  proof  of  the  necessity  of 
Theorem  3.2,  (BLUE  p,01  in  combined  data)  a  (BLUE  p£8j.  in 
experiment  l)  +  (BUIE  ^  exPeriment  2).  Note  that  p1 

and  p2  are  invariant  whatever  h,  satisfying  (4.3).  However, 
pi*l  8111(1  P^®i  'nay  not  be  multiples  of  p‘0^  . 
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Let  as  look  at  Question  1.  Suppose  b^Y^  Is  the  BLUE  of 
p'81  in  experiment  i  ,  i  »  1,2  .  Here  p  e  n  /(Zg).  If 

cUjbjYi  +  Ogb^Yg  is  the  BLUE  of  p'91  when  the  data  are  combined, 
the  unbiasedness  condition ^  p'91  •  E(o1b£Y1  +  OgbgYg)  ,  for  all  81  , 
implies  that  °i  +  <*2  *  1  •  Among  all  such  unbiased  estimators 
the  one  with  minimum  variance  has 

V(b£Y2) 

“i  -  vtb^)  tt xsgg  -  °(»ay)  «“>  °2  -  1  -  °  • 


Explicitly, 


P'Zlp 

(X  3S  ■  C - Z— 

P'Z^P  +  P'ZgP 


(4.4) 


So  the  only  linear  combination  which  can  be  BLUE  of  p'9^  under 
the  combined  experiment  is 


b»Y  -  ob^  +  (l-o)b^Yg  .  (4.5) 

The  following  Theorem  answers  Question  1. 


Theorem  4.1.  Let  p  e  /(Z^  n  Jt(Zg)  p  4  0  .  Then  equation 

(4.1)  is  satisfied  iff  there  exists  vectors  h^  and  hg  such  that 

.  op  ,  /jhj  •  (l-o)p 


(♦.6) 
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Xi2Xllhl  +  *12*12*2  "  0  ■  X22X21hl  +  X^2X22h2  “  0  <*-7> 

Proof:  b'Y  in  (4.5)  is  obviously  unbiased.  By  Lemma  2.2  (i) 
it  is  BLUE  of  p'e1  iff 

abl  \  e  J*  11  X12 

a)b2j  *J\x21  X^ 


obx  -  +  P1X11h1  +  X^hg  (4.10) 

where  ■  *\t>  ^*12*12  ^  *12  *  Section  3  *  Premult  ip  lying 

(4.10)  by  X£g  we  get,  since  XJg^  *  0  , 

0  -  X^X^  +  *12*12*2  *  (4.11) 

Premultiplying  (4.11)  by  X^X^X^)'  , 


0  *  +  Xjghg  » 


(4.12) 
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This,  together  with  (4.10)  Implies 

abx  -  (I-PxJXjjhj.  (4.13) 

It  is  easy  to  see  the  equivalence  of  (4.11)  and  (4.12).  There¬ 
fore  (4.11)  and  (4.13)  together  imply  (4.10).  Hence  (4.10)  is 
equivalent  to  ((4.11),  (4.13)}.  Premultiplying  (4.13)  hy 
Xix  ,  we  have,  since  Xil*l  *  P  ' 


op  » 


(4.14) 


By  Lemma  2.2  (ii) 


bl  *  ^I‘P1)X11 t 

for  some  t.  Thus  p  »  X|1(I-P1)X^1 t  .  So  (4.14)  becomes 

*  xii<I-pi>xii  hi  - 


which  lcplies  that 


o(l-P1)Xnt  -  (l^JXj^^  . 

Thus  (4.13)  and  (4.14)  are  equivalent.  Hence  (4.8)  is  equivalent 
to 


Xi2Xllhl  +  Xi2X12h2  *  0 


and 
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■  op 

Similarly  (4.9)  is  equivalent  to 

/2h1  -  (l-a)p  and  X^gXg^  +  X£gXgghg  -  0  . 

Hence  the  theorem. 

Let  us  move  on  to  Question  2.  Here,  we  want  (4.1)  to  be 
valid  for  all  p  in  X(f^)  n  J5(/g)  which  we  assume  to  be  differ¬ 
ent  from  C Oj .  Let  T  be  a  nonsingular  matrix  such  that, 

-  T'diagfD^D^.O.OjT  ,  /g  -  T'diag(Dglf0,Dg2,0)T  (4.15) 

where  the  ' s  are  diagonal  matrices.  Note  that  the  simultaneous 
dlagonalization  of  / ^  and  /g  are  carried  out  in  such  a  way 
that  the  corresponding  component  matrices  in  diag(D11,D1g,0,0) 
and  diag(Dg1,0,D^g,0)  are  of  the  same  dimension.  and  Dg^ 

are  nonsingular.  If  X(f^)  c  -C(/g)  ,  then  there  will  be  no  D^g 
and  we  shall  write  the  matrices  in  (4.15)  with  only  three  diagonal 
blocks.  Similar  modifications  are  needed  when  JC(Sg)  c  Xff^)  . 

If  X(/^)  «  -£(/g)  ,  then  we  need  only  two  diagonal  blocks,  i.e., 

-  T'diag(Dn,0)T  ,  /g  -  T*diag(D21,0)T  (4.16) 

Considering  only  the  general  case  when  X(fj)  0  X(f g)  contains 
neither  X(f  nor  X{f^)  *  **  may  insist  that  D^g  and  Dgg 
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are  nonsingular.  The  results  for  the  special  cases  mentioned 
above  may  be  obtained  from  the  general  results  with  obvious 
modifications  -  we  shall  only  consider  the  case  ■  /(/2) 

separately,  as  this  is  of  special  Interest . 

Partitioning,  T*  -  (T£  :  T£  :  Tj  s  T£)  ,  we  get  from  (*.15)  , 

fl  -  -PlA  +  T^12T2  •  >2  -  TiD21Tl  +  TiD22TJ 

Defining 

/ll  -  T{DnTl  *  ^12  “  T2D12T2  ’  ^21  *  T1D21T1  » ^22  *  T3D22T3 

we  obtain 

*l"^ll  +  ^12  *  ^2*^21 +  ^22  (*.17) 

M-u)  (1/y  -  (0)  ,  /tf21)  nJ?(/22)  -  (0} 

(*.18) 

Aif^)  n  W22)  -  (0)  .  /Cfn)  -  .W21)  -  Jj(f4)  n  4(fa) . 

From  Theorem  3.1  It  follows  that 

/•/2jL  +  ^i2+^21  +  ^22  +  R 
where  R  is  a  nonnegative  definite  matrix. 

Theorem  *.2.  BLUE  of  p*0.  in  the  combined  experiment 
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«  a(BCUE  of  p'e1  In  experiment  l) 

+  (l-*a)(BLDE  of  p'e^  In  experiment  2)  (4.19) 

for  all  pc  /(/2)  n  /(/2)  ,  Iff 

/21  -  f0T  ®«ne  X  >  0  (4.20) 

and 

/  -  +  /21  +  Rq  (4.21) 

with 

R0  ■  /jg  +  /22  +  R  satisfying 

WnLn  /(R0)  -  (0J  (4.22) 

Proof:  Suppose  (4.19)  Is  given.  Then  (4.6)  is  satisfied  for 
each  p  e  /(Z^)  -  jC(/21)  .  The  equation 

*  09 

can  be  written  equivalently  as 

^11  ^1  *  ap  »  ^12  *1  "  0 


using  the  facts  that  p  c  /Cfu)  and  /(/n)  n  /(/^g)  -  (o)  . 
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So,  equation  (4.6)  implies 

/-q -  op  and  *  U“«)  P  » 

and  this  is  true  for  all  p  e  jC(/^)  .  Lama  2.3  now  gives 

/ 21  *  X^u  »  for  *<»«  X  >  0 
which  is  (4.20).  Observe  that, 

p’/;p  p'/^p 

a  »  ■  —  -  -  ■  *  "  ■  ■  ■  ■ "  ■  -  ■  ■  * 

P’/^W/gP  P'/^lH-P’/giP 

since  by  Lemma  2.4,  p‘/j[p  ■  P'/j^P  ,  i  -  1,2 
pendent  of  p . 

Taking  variances  on  both  sides  of  equation  (4.19),  one 
obtains,  for  each  p  e  /(/^)  , 

P'/"P  -  o2p'/£p  +  (l-a)2p '/gP 

-  o^p'/^p  +  (l-o)2p'/2iP 

-  (l+X)^'/^  p  using  (4.23) 

-  P'^oP  * 

where  /e  *  +  /2x  “  (l+xXfj^  *  Since  this  is  true  for  all 

P  «  M0)  ,  »•  ««t 


(4.23) 

1+X 

.  Thus  o  is  inde 
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o~2  V(BLUE  of  p'e^  in  combined  experiment) 

«  p«/"p 

-  p’/;p 

using  the  "(i)  ■»  (ii)"  part  of  Lemma  2.4,  and  (4.22).  This 
being  the  same  as  (4.24),  we  appeal  to  the  uniqueness  of  the 
BLUE  in  the  classical  Oauss-Markov  model  to  conclude  that  the 
estimator  b'Y  is  the  BLUE  in  the  combined  model.  Hence  we 
get  equation  (4.19). 

Remark  4.1.  Under  conditions  of  the  Theorem, 

f  -'o+Ro  • 

Using  Lena  2.4  and  equation  (4.22),  we  get  p"/“p  -  p’S~p  , 
for  each  p  e  JUf0)  and  p'/“p  «  p*R“p  *  for  each  p  c  /(Rc)  . 
In  this  sense  i  and  R^  are  themselves  information  matrices. 
Thus  (4.21)  gives  a  decomposition  of  /  into  two  Information 
matrices  whose  spaces  are  disjoint.  In  a  sense  the  combined 
experiment  can  be  looked  upon  as  the  union  of  two  disjoint 
(fictitious)  experiments ,  one  consisting  of  the  common  part  of 
the  individual  experiments  (corresponds  to  /Q),  and  the  other 
consisting  of  the  remainder  of  the  individual  experiments 
(corresponds  to  $22)  lnd  ***  portion  which  is  purely 

the  profit  from  combining  (corresponding  to  R).  Also  note  that 
(4.22)  implies 


Remark  4.2.  Since  ^  *  12  0  ^(/ 12^  *  * 

/  and  f2i  &re  themselves  Information  matrices,  by  Lemma  2.4. 
/u  is  the  Information  matrix  of  parametric  functions  common  to 
both  experiments  and  that  of  function  which  can  be  estimated 

from  experiment  1  only.  A  similar  explanation  holds  for  /gi 
and  /22  .  Thus  when  the  conditions  of  the  Theorem  are  valid, 
estimable  functions  In  /(/1)  n  .€(/2)  are  estimated  by  a  linear 
combination  of  the  BLUEs .  Estimable  function  from  .f(/12)  are 
nonestlmable  In  experiment  2,  but  thlr  BLUEs  may  use  observations 
from  experiment  2  when  the  experiments  are  combined.  This  is 
because  0  /(R)  need  not  be  (0).  A  similar  observation 

holds  for  functions  from  /(/22)  . 

As  an  example,  consider  the  following  block  designs,  given 
by  their  Incidence  matrices 


The  combined  experiment  has  the  Incidence  matrix 

(1  1  1  1  0  o\ 

1  1  0  0  0  0 

0  0  0  0  2  2/ 

Straightforward  computations  show  that  f  m  ^21  *  diag(0,0,A) , 
where  A  -  *J)  .  and 
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rs  -3  *i  -i  o  on 
-35-1-100 
1  -1-13-100 
¥-1-1-1300 
0  0  0  0  4-4 
Lo  0  0  0  -4  4. 


Clearly  the  conditions  of  the  theorem  are  satisfied.  Hence  the 
BLUE  of  -  Tg  is  a  linear  combination  of  the  BLUEs  In  the 
original  experiments .  can  be  estimated  from  experiment 

2  only,  -  Tg  is  estimable  in  experiment  1  only,  but  its  BLUE 
in  the  combined  experiment  uses  observations  from  experiment  2 
also.  This  is  due  to  the  fact  that  the  observations  in  block  2 
in  each  individual  experiment  cannot  be  used  due  to  confounding, 
but  are  released  when  they  are  combined,  Tg  -  can  be  estimated 
only  in  the  combined  experiment.  We  give  this  somewhat  oversimple - 
fled  example  to  illustrate  Theorem  4.2  in  a  way  that  can  be  seen 
obviously  without  getting  involved  into  complicated  structures  of 
the  matrices  associated  with  problem. 

In  the  following  theorem,  the  conditions  of  Theorem  4.2  are 
expressed  in  a  form  which  is  better  suited  for  computational  veri¬ 
fication.  We  give  a  general  form  which  must  be  simplified  in  partic¬ 
ular  settings  in  which  an  experimenter  is  interested. 

Theorem  4.3.  Equation  (4.19)  is  satisfied  for  all  p  in 
nJ?(/2)  iff 

/gi  *  X/ji  »  tor  some 

and 

PUha 


X  >  0  . 


(4.20) 


*22*21 


(*•25) 
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where  T_1  »  Q'  ■  (Q£  :  :  Q^)  ,  T  being  defined  in  (4.15), 

and  the  partitions  corresponding  to  those  of  T'. 


Proof;  As  noted  in  the  proof  of  Theorem  4.2,  (4.6)  is  equivalent  to 


f  ^j^h^  ~  ®P  *  *  21^1  * 

and 

/l2hl  =»  0  ,  /22hl  3  0 


(4.26) 

(4.27) 


since  p  «  £(S^)  .  Equation  (4.26)  can  be  replaced  by 

/2i  *  1/n  .  for  some  X  >  0  ,  /11h1  -  ctp  (4.28) 


So  (4.19)  is  equivalent  to  the  following  statement: 

For  each  p  c  jCCf^)  ,  there  exists  ,  hg  such 

that  (4.27),  (4.28)  and  (4.7)  are  satisfied.  (4.29) 

Note  that  (4.28)  must  imply  that  a  *  (l+X  )-1.  Let 
Th^  ■  s  ,  (T*  )“^p  •  q  and  partition  s*  -  (s£  :  s£  :  s^  :  sj^) 
and  q'*  (q£  :  q^  :  q^  :  q£).  A 8  in  the  proof  of  Lemma  2.5, 
qg  ,  q^  and  q^  are  all  null  vectors  since  p  c  /(/u)  • 

Ovserve  that 


/nhi  -  op  •  L11»1  -  oq1  •  s1  -  oD^ 


4 


j 
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For  each  p  c  JUf±i)  *  there  exists  hg  sad  s^ 

such  that  /2 1  *  *  ****  110110  «  *  0 ,  sad 

(4.30)  sad  (4.31)  are  satisfied.  (4.32) 


p  varies  over  Iff  Is  any  arbitrary  vector,  which 

is  true  Iff  s^  *  (l-fX)"LD{Jq1  is  any  arbitrary  vector. 
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s2  and  oust  be  0  as  we  have  already  noted.  So,  (4.32)  Is 

equivalent  to 


/2i  **  X  •  *or  some  X  >  0  ,  and  there  exists 
matrices  H  and  S  such  that 


Xi2XllQi  =  X12X12H  +  Xi2XllQ4S 

X22X21^i  “  X22X22H  +  HA lHS  * 


(4.33) 


Clearly  (4.33)  is  equivalent  to  (4.20)  and  (4.25). 

A  case  of  special  interest  is  when  both  the  experiments  are 
designed  to  have  the  same  estimable  functions  of  0^  .  i.e.. 


JKfj.)  -  /(/2)  (4.34) 

In  this  case  »  /22  »  0  ,  u  ,  /2  -  f21  ,  RQ  -  R. 

Theorems  4.2  and  4.3  can  be  restated  with  some  simplifications 
when  (4.34)  is  satisfied.  We  do  these  in  Corollaries  4.1  and  4.2. 

Corollary  4,1.  If  /(Z^)  «  /Cf2)  »  then  for  each  p  6  /(/^)  , 

BLUE  of  p'0^  in  the  combined  experiment  is  a  linear  combination 
of  the  BEXJEs  of  p'e^  in  the  individual  experiments  iff 

/2  «  X/i  for  some  X  >  0 

and 

/  •  +  /2  +  R  with  /(/1)  n  M R)  -  (0)  . 
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Corollary  4.2.  If  .€(/ 1)  =  jK(/g)  then  equation  (4.19)  Is 
satisfied  for  all  p  e  iff 

/2  »  X/l  for  some  X  >  0 

and 

c  Xi2Xliai 

WAl0!/  'V&fe  X22X21tIa 

where  U'  =  (U^  :  U£)  is  the  inverse  of  T,  which  is  any  non¬ 
singular  matrix  diagonalizing  ,  i.e., 

»  T'  d iag(DltO)T  , 

and  T'  »  (T£  :  T£)  .  Both  and  U£  have  as  many  columns 
as  r(f  j) . 

As  illustrations  consider  the  examples  in  Section  3 •  In 
Example  3.1,  »  /g  but  the  conditions  of  Corollary  4.2  are 

violated.  Hence  the  BLUE  of  t^-Tg  is  not  a  linear  combination 
of  BLDEs  of  the  individual  experiment.  Clearly,  the  observations 
in  Block  2  in  the  individual  experiments  can  only  be  used  when 
the  data  are  combined. 

In  Example  3.2,  *  /g  and  the  conditions  of  Corollary 

4.2  are  satisfied.  The  BLDEs  of  T^-Tg  and  are  obtained 

as  linear  combinations  from  the  individual  experiments. 

However  t2 ~  t ^  is  estimable  only  when  the  data  are  combined . 
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Thus  to  obtain  Its  BLUE  the  entire  data  has  to  be  put  together 
and  analysed.  This  leads  us  to  the  next  question.  Under  what 
conditions  Is  (4.1)  satisfied  for  all  p  €  /(/)  ,  if 
MS )  *  JC(/1)  »  /(/ 2)  .  The  following  corollary  answers  this 
as  a  special  case  of  Corollary  4.1. 

Corollary  4.3.  If  MS±)  »  MS2)  *  MS)  »  then  for  each 
P  e  /(/)  ,  the  BLUE  of  p'8^  In  the  combined  experiment  is  a 
linear  combination  of  the  BLUEs  of  p'e^  In  the  Individual 
experiments  Iff 

/2  *  XS^  for  seme  \  >  0 

and 

/  •  s1  +  s2. 

Proof:  Follows  from  Corollary  4.1  since  if  /*/,+/„  +  R  , 
and  /(/)  -  -  MS2)  »  then  f2  -  X^  and 

MSX)  n  /(R)  -  (0)  iff  f2  -  x/i  and  R  -  0. 

For  the  sake  of  completeness,  we  state  the  version  of 
Corollary  4.2  when  /(/)  -  MS±)  »  J VJ2)  ,  In  Corollary  4.4. 

The  proof  follows  directly  from  Theorem  3.1. 

Corollary  4.4.  If  /(/)  -  MS))  *  MS2)  »  then  equation  (4.19) 
is  satisfied  for  all  p  c  MS)  iff 
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/2  »  \/ 1  for  sane  X  >  0 

and 

J*khi\  _  .(Hshs) 

1*22*21/  J\*22*22/ 

Examples  where  these  results  are  valid  may  be  easily  obtained 
by  considering  two  block  designs  with  the  same  incidence  matrix, 
and  having  isomorphic  designs  In  their  common  blocks. 

Finally,  we  remark  that  In  many  cases  superadditivity  of 
Information  matrices  Is  due  to  the  fact  that  some  observations 
may  not  be  used  in  the  BI-UEs  based  on  individual  experiments  due 
to  confounding,  but  may  be  released  and  utilised  for  estimating 
linear  functions  of  0^  when  the  data  from  the  experiments  are 
combined.  This  is  clear  from  the  examples  considered  in  this  * 
and  the  previous  section. 

5.  COWCLDDIWO  REMARKS. 

In  this  report  we  have  established  the  superadditivity  of 
information  matrices  and  explored  conditions  for  additivity.  We 
have  also  found  conditions  when  BLUEs  for  the  combined  experiment 
can  be  computed  simply  from  the  BLUEs  of  the  Individual  experiments. 
The  conditions  provide  geometrical  and  statistical  insight  into 
problems  associated  with  combining  experiments.  The  results  are 
for  a  general  linear  model.  In  any  particular  setting  they  may 
have  to  be  translated  to  a  more  readily  verifiable  form.  As  an 


example,  we  may  refer  to  Corollary  3.1  where  the  conditions  for 
the  equality  f  ■  +  /g  have  been  expressed  in  a  very  elemen¬ 

tary  form  for  the  block  design  setup. 

Ve  hope  that  the  results  in  this  report  may  be  profitably 
utilized  in  further  research.  As  an  illustration  we  can  consider 
a  problem  of  extending  an  experiment  optimally.  Suppose  an 
experimenter  has  conducted  an  expert ement  for  comparing  treatments 
in  b  block  of  size  k  each.  Then  suppose  he  is 

given  funds  to  conduct  another  experiment  of  the  same  nature  to 
improve  on  his  findings.  Suppose  he  has  the  choice  of  either 
using  new  blocks  ,  or  using  the  old  blocks  again. 

Suppose  the  experiment  is  such  that  the  old  blocks  have  no 
residual  effect  from  the  earlier  experiment,  l.e. ,  the  model 
remains  the  same.  The  question  is,  how  should  the  experimenter 
extend  the  experiment  to  get  the  best  results?  What  should  be 
his  new  design? 

Let  / 1  be  the  information  matrix  of  the  old  design, 
the  information  matrix  of  the  extension  and  f  the  information 
matrix  of  the  combined  experiment.  If  the  experimenter  uses 
new  blocks,  then  surely  $  »  +  /g  .  In  fact,  if  he  uses 

some  or  all  of  the  old  blocks,  then  also  /  “  ^1  +  ^2  M 
as  the  designs  in  the  common  blocks  are  isomorphic.  Thus,  he 
should  use  the  same  blocks  and  try  not  to  have  the  new  design 
Isomorphic  to  the  old  one.  Obviously,  if  the  old  design  can 
be  extended  to  an  optimal  design  in  b  blocks  of  size  2k  each 


(for  example,  a  Balanced  Block  Design),  then  the  information  in 
the  combined  experiment  has  been  "maximized".  Otherwise,  one 
suspects  that  in  general  the  extension  should  be  as  far  as  possible 
from  designs  which  sure  isomorphic  to  the  old  design. 

Bow ' suppose  the  experimenter  has  only  two  choices:  to  repeat 
the  same  experiment  (same  design)  in  new  blocks  or  in  the  old 
blocks.  Clearly  the  information  matrix  is  the  same  in  both  cases, 
viz  +  /2  ,  where  ,#1  and  have  their  usual  meanings. 

But  if  he  uses  old  blocks  he  saves  a  lot  of  degrees  of  freedom 
which  would  be  otherwise  used  for  the  new  block  effects.  He  can 
use  them  to  test  the  validity  of  his  model.  Alternately,  he  can 

o 

use  this  to  get  a  better  estimate  of  a  ,  the  measurement  error. 
In  this  case,  though  he  is  estimating  the  estimable  treatment 
parameters  with  the  same  precision  as  he  would  have  if  he  uses 
new  blocks,  his  estimate  of  the  variance  of  estimate  improves 
considerably.  We  may  remark  that  unfortunately  the  precision  of 
estimators  of  as  has  been  suggested  by  Fisher  (1971)  has 
been  largely  ignored  in  the  literature  of  optimal  design  of 
experiments . 
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